Back

Human Mutation

Wiley

Preprints posted in the last 30 days, ranked by how well they match Human Mutation's content profile, based on 29 papers previously published here. The average preprint has a 0.03% match score for this journal, so anything above that is already an above-average fit.

1
PAVS: A Standardized Database of Phenotype-Associated Variants from Saudi Arabian Rare Disease Patients

Abdelhakim, M.; Althagafi, A.; SCHOFIELD, P.; Hoehndorf, R.

2026-04-06 genetic and genomic medicine 10.64898/2026.04.05.26350189 medRxiv
Top 0.1%
10.3%
Show abstract

Genotype-phenotype databases are essential for variant interpretation and disease gene discovery. Genetic variation differs among human populations, mainly in allele frequencies and haplotype patterns shaped by ancestry and demographic history. Population-specific genotypes can influence traits and disease risk; this makes population specific characterization important. Most existing resources focus on the characterization of a population's genetic background, but do not represent the resulting phenotypes. We have developed PAVS (Phenotype-Associated Variants in Saudi Arabia), a curated, publicly accessible database that integrates 5,132 Saudi clinical cases from four Saudi cohorts and 522 cases from analysis of a mixed-population cohort, together with 1,856 cases from the Deciphering Developmental Disorders study (DDD) and 9,588 literature phenopackets. Each case record describes patient-level phenotypes, encoded with the Human Phenotype Ontology (HPO), and links them to genomic variants, gene identifiers, zygosity, pathogenicity classifications, and disease diagnoses mapped to standardized disease terminologies. The data is represented in Phenopackets format and as a knowledge graph in RDF. Additionally, a web interface provides phenotype-based similarity search, gene and variant browsers, and an HPO hierarchy explorer. We evaluate the utility of the phenotype annotations for gene prioritization using semantic similarity. While there are clear differences to global literature-curated databases, phenotypes in PAVS can successfully rank the correct gene at high rank (ROCAUC: 0.89). PAVS addresses a gap in population-specific genotype-phenotype resources and provides a benchmark for phenotype-driven variant prioritization in under-represented populations.

2
Assessing the clinical significance of a novel rare variant in Loeys-Dietz Syndrome by combining AI-driven modelling and cell biology

Boukrout, N.; Delage, C.; Comptdaer, T.; Arondal, W.; Jemel, A.; Azabou, N.; Bousnina, M.; Mallouki, M.; Sabaouni, N.; Arbi, R.; Kchaou, S.; Ammar, H.; Hantous-Zannad, S.; Jilani, H.; Elaribi, Y.; Benjemaa, L.; Van der Hauwaert, C.; Larrue, R.; CHEOK, M.; Perrais, M.; Lefebvre, B.; Cauffiez, C.; Pottier, N.

2026-03-31 genetic and genomic medicine 10.64898/2026.03.30.26349510 medRxiv
Top 0.1%
9.9%
Show abstract

Loeys-Dietz syndrome (LDS) is an autosomal dominant connective-tissue disorder caused by genetic variants in TGF-{beta} pathway genes, most often TGFBR1/2. While pathogenic TGFBR2 genetic mutations usually cluster in the kinase domain and disrupt SMAD signalling, distinguishing with confidence those with functional impact on TGFBR2 function from rare benign genetic alterations represents one of the most important ongoing challenges for accurate genetic testing. Therefore, there is a pressing need to develop methods that can improve functional variant interpretation. Here, we describe and characterize the functional impact of a novel genetic variant in the TGFBR2 kinase domain (E431K), in a patient with the clinical diagnosis of syndromic genetic aortopathy. We assessed the structural and functional consequences of this variant using AI-driven molecular modelling and in vitro cell-based assays. A high-quality homology-based model of TGFBR2 was generated and computational mutagenesis based on the structural context and evolutionary conservation was used to forecast variant pathogenicity. Relative to wild type, the variant affects protein stability by disrupting intramolecular interactions and likely induces conformational changes that may affect kinase activity and thus TGF-{beta} signalling. This was experimentally confirmed by showing abnormal protein level and alteration of canonical TGF-{beta} pathway activation. Overall, our results establish that the E431K variant leads to aberrant TGF-{beta} signalling and confirm the diagnosis of Loeys-Dietz syndrome type 2 in this patient.

3
The Russian FSHD registry: a first look at the cohort

Kuchina, A.; Sherstyukova, D.; Borovikov, A.; Soloshenko, M.; Zernov, N.; Subbotin, D.; Dadali, E.; Sharkova, I.; Rudenskaya, G.; Kutsev, S.; Skoblov, M.; Murtazina, A.

2026-04-01 genetic and genomic medicine 10.64898/2026.03.31.26349837 medRxiv
Top 0.1%
8.4%
Show abstract

Background: Facioscapulohumeral muscular dystrophy (FSHD) is a common hereditary neuromuscular disorder. The Russian FSHD Patient Registry was established in 2019 following the development of a PCR-based method for genetic confirmation of the diagnosis. Results: The registry included 470 participants (51% male). Genetic confirmation was obtained for 76% (n=356), the remainder were included based on clinical and anamnestic data. Clinical assessment forms and patient-reported questionnaires were analyzed for 310 and 142 patients, respectively. D4Z4 repeat unit (RU) distribution showed patterns consistent with European cohorts, with a predominance of patients with 3 RUs. A moderate inverse correlation was found between RUs number and clinical severity scales. Periscapular weakness was the most common onset manifestation (46.8%), followed by facial weakness (31.6%) which was often unnoticed by patients. The mean age in the Russian cohort was 37.8 years (range 0-97), indicating a younger cohort compared to international data. A delta-adjusted cluster analysis (n=215) identified three distinct trajectories: a classic phenotype with onset before age 14 and early involvement of various muscle groups (n=177), and two clusters characterized by either facial or periscapular onset with slow progression. Conclusion: The Russian FSHD registry provides a comprehensive characterization of a large national cohort, revealing a predominance of patients with 3 D4Z4 repeats and a younger demographic profile compared to international data. Cluster analysis identified three heterogeneous disease trajectories, offering a framework for improved patient stratification.

4
Ancestry-stratified variant classification in monogenic diabetes genes: annotation coverage and differential curation burden

Dario, P.

2026-04-07 genetic and genomic medicine 10.64898/2026.04.06.26350230 medRxiv
Top 0.1%
7.2%
Show abstract

Variant databases ClinVar and gnomAD are the backbone of clinical variant interpretation, but their population composition is skewed toward European ancestry. Whether this skew creates systematic classification disadvantages for non-European patients with monogenic diabetes has not been examined at the database level. ClinVar variant_summary (GRCh38, April 2026; 4,421,188 variants) was cross-referenced with gnomAD v4.0 genome data for 17 monogenic diabetes genes. Annotation coverage and variant classification rates were computed stratified by genetic ancestry group (AFR, AMR, EAS, SAS, MID, NFE, FIN, ASJ). Of 14,691 gnomAD variants across the 17 genes, only 29.7% had any ClinVar classification (range: 12.7%-61.3% by gene). Among classified variants, non-Finnish European (NFE) variants had the highest variant of uncertain significance (VUS) rate (32.1%) and the lowest benign/likely benign fraction (41.6%), consistent with a large submission volume without functional follow-up. African-ancestry (AFR) variants showed the second-highest VUS rate (29.2%), not statistically distinguishable from NFE after Bonferroni correction, while all other non-European groups had significantly lower rates (all p < 0.001). GCK showed a pattern inversion - non-European VUS rate (18.5%) exceeding European (15.0%) - consistent with progressive reclassification in European populations absent in non-European cohorts. Annotation coverage and VUS divergence were uncorrelated (r = -0.15, p = 0.57). The primary equity problem is a 70% annotation gap combined with a non-European curation deficit, not a simple VUS excess. Ancestry-stratified evaluation of ClinGen Variant Curation Expert Panel (VCEP) criteria performance is warranted across disease domains.

5
Identifying disease-causing mechanisms and fundamental biology of neuromuscular disorder genes through genomic feature analysis

Martin, A.; Llanes-Cuesta, M. A.; Hartley, J. N.; Frosk, P.; Drogemoller, B. I.; Wright, G. E. B.

2026-04-22 genetics 10.64898/2026.04.21.719902 medRxiv
Top 0.1%
6.3%
Show abstract

IntroductionNeuromuscular disorders (NMDs) encompass a broad group of conditions that primarily affect the peripheral nervous system. They are often caused by genetic alterations that impair skeletal muscle function and result in debilitating symptoms. Obtaining an accurate molecular diagnosis remains a challenge, potentially because variants in genes that have yet to be identified as causal. We therefore used advanced computational methods to study the genetic architecture of NMDs and to identify key features that distinguish NMD genes from other genes in the broader genome. MethodsCurated genes implicated in NMDs (n = 639; GeneTable of NMDs) were obtained and merged with a comprehensive set of genomic features for human autosomal protein-coding genes. Machine-learning-based feature selection and ranking were performed using Boruta, along with complementary analytical approaches. These analyses were used to identify the most important genic features (n = 134, subcategories: gene complexity, genetic variation, expression patterns, and other general gene traits) for discriminating NMD genes from other genes in the genome ResultsNMD genes exhibit enriched expression in disease-relevant tissues, including skeletal muscle and heart. Additionally, compared with other protein-coding genes, these genes exhibit increased transcriptomic complexity (e.g., longer transcripts and more unique isoforms), contain more short tandem repeats, and show greater variation in conservation across model organisms. ConclusionsThis study identified several key genomic features that may distinguish NMD genes from the rest of the genome. This may enhance the identification of novel causal genes and could ultimately facilitate earlier diagnosis and medical management for affected individuals.

6
Patient iPSC-Derived Cartilage Organoids Reveal Defective ECM Deposition and Altered Chondrogenic Trajectory in Saul-Wilson Syndrome

Mahajan, S.; Ancel, S.; Ascone, G.; Kaur, R.; Torres, J.; Murad, R.; Wang, Y. X.; Ferreira, C. R.; Freeze, H.

2026-04-14 developmental biology 10.64898/2026.04.10.717608 medRxiv
Top 0.1%
6.3%
Show abstract

Saul-Wilson syndrome (SWS) is a skeletal dysplasia characterized by primordial dwarfism and progeroid features caused by a recurrent dominant COG4 variant (p.G516R). We previously showed that this mutation accelerates Golgi retrograde trafficking and disrupts glycosylation of the proteoglycan decorin, while zebrafish models revealed defects in chondrocyte elongation and intercalation. We have also shown that the SW1353 chondrosarcoma cells carrying the SWS variant exhibit reduced secretion of extracellular matrix (ECM) components. While these results indicate a critical function of COG4 in Golgi processing, the developmental process leading to skeletal dysplasia in SWS patients remains unknown. Here, we generated patient-derived iPSC cartilage organoids (SWS organoids), modeling early human chondrogenesis. SWS organoids failed to produce cartilage structures and displayed poor expression of chondrogenic markers. Time-course RNA-seq analysis of the chondrogenic process revealed reduced activation of gene networks involved in skeletal development, ECM organization, ossification, and glycosaminoglycan metabolism. Spatial multiomic analysis of protein and glycosylation by CODEX and GLYPH imaging revealed an altered chondrogenic trajectory, persistence of mesenchymal states, global glycosylation changes, and reduced deposition of chondroitin sulfate proteoglycans. These results indicate that the COG4 mutation disrupts ECM glycosylation and chondrogenic commitment, and that SWS organoids model early defects in cartilage formation underlies impaired skeletal growth in SWS. HighlightsO_LIPatient iPSC-derived cartilage organoids model development defects in Saul-Wilson syndrome C_LIO_LISWS organoids show defective extracellular matrix deposition and attenuated chondrogenic gene expression C_LIO_LIGlycan profiling reveals global glycosylation defects and deficient proteoglycan GAG chains C_LIO_LIAn early developmental impairment in chondrogenesis alters skeletal formation in Saul-Wilson syndrome C_LI

7
LZTS2 Emerges as a Regulator of Craniofacial Development and Modulator of DYRK1A

Cheng, N.; Lima, S.; Litovchick, L. L.; Dickinson, A. J. G.

2026-04-02 developmental biology 10.64898/2026.03.31.715576 medRxiv
Top 0.2%
3.7%
Show abstract

BackgroundPrecise control of DYRK1A dosage is essential for embryonic development, including craniofacial morphogenesis. While LZTS2 is among the most consistently identified DYRK1A-interacting proteins, its roles in embryonic development remain incompletely understood, and its potential contribution to craniofacial development has not been examined. Xenopus laevis was used to test the role of LZTS2 in craniofacial development and its functional relationship with DYRK1A. ResultsLzts2 and Dyrk1a showed overlapping expression during craniofacial development, with both proteins present in developing facial tissues. Knockdown of Lzts2 disrupted craniofacial morphogenesis and reduced expression of the neural crest-associated genes sox9 and pax3. These phenotypes closely resembled those caused by decreasing Dyrk1a function. Sub-phenotypic reductions of Lzts2 and Dyrk1a synergized to produce craniofacial defects, while partial reduction of Lzts2 attenuated aspects of the phenotype caused by Dyrk1a overexpression. Comparative analysis of human phenotypes associated with copy number gains of LZTS2 and DYRK1A revealed striking overlap, consistent with a potential functional interaction between these genes in humans. ConclusionsThese findings identify Lzts2 as a previously unrecognized regulator of craniofacial development and support a functional interaction with Dyrk1a during embryogenesis. Modulating LZTS2 or related regulatory partners may provide a strategy to selectively tune DYRK1A-dependent developmental pathways

8
The Power of Partnership: Democratizing Genetic Prevalence to Empower Patient Advocacy

Baxter, S. M.; Singer-Berk, M.; Glaze, C.; Russell, K.; Grant, R. H.; Groopman, E.; Lee, J.; Watts, N.; Wood, J. C.; Wilson, M.; Rare As One Network, ; Rehm, H. L.; O'Donnell-Luria, A.

2026-03-31 genetic and genomic medicine 10.64898/2026.03.30.26349539 medRxiv
Top 0.2%
3.2%
Show abstract

Introduction: Accurate estimation of disease prevalence is crucial for public health and therapeutic development, but traditional methods are often inaccurate. Genetic prevalence, which estimates the proportion of a population with a causal genotype, using allele frequencies from population data, offers an important alternative. Methods: We partnered with 18 Rare As One patient organizations to estimate genetic prevalence for 22 autosomal recessive conditions using population data from two releases of the Genome Aggregation Database (gnomAD). To standardize and democratize these analyses, we developed the Genetic Prevalence Estimator (GeniE), a publicly available tool, for accessible calculations. Results: Conservative carrier frequencies in gnomAD v4.1 ranged from 1/164 to 1/11,888. The median change in genetic prevalence frequency between v2.1 to v4.1 was 0.806. Partnership with patient advocacy groups provided critical real-world context that refined the interpretation of these estimates. Discussion: These findings highlight that genetic prevalence is not a static figure but a dynamic, evolving measure with important caveats that need to be considered. Our study underscores the necessity of re-evaluations as databases expand. By integrating patient-partnered insights with the GeniE platform, we empower the genomics community to maintain transparent, up-to-date, and actionable data for rare disease advocacy and drug development.

9
Polygenic risk scores enhance the identification of carriers of monogenic forms of idiopathic pulmonary fibrosis

Alonso-Gonzalez, A.; Jaspez, D.; Lorenzo-Salazar, J. M.; Delgado, A.; Quintero-Bacallado, A.; Ma, S.-F.; Strickland, E.; Mychaleckyj, J.; Kim, J. S.; Huang, Y.; Adegunsoye, A.; Oldham, J. M.; Maher, T. M.; Guillen-Guio, B.; Wain, L. V.; Allen, R. J.; Saini, G.; Jenkins, R. G.; Molina-Molina, M.; Zhang, D.; Kim Garcia, C.; Martinez, F. J.; Noth, I.; Flores, C.

2026-04-18 genetic and genomic medicine 10.64898/2026.04.16.26350967 medRxiv
Top 0.2%
2.8%
Show abstract

Background: Idiopathic pulmonary fibrosis (IPF) is a rare disease with a poor prognosis. Disease risk involves rare and common genetic variants. However, an inverse association have been described between them. Accordingly, IPF patients with a higher polygenic risk score (PRS) for IPF are less likely to carry rare deleterious variants and vice versa. Here, we evaluate weather PRS of IPF could serve as an additional criterion to patient prioritisation for rare variant discovery. Methods: We identified carriers based on the presence of rare qualifying variants (QVs) in genes linked to monogenic forms of pulmonary fibrosis in 888 IPF patients from the Pulmonary Fibrosis Foundation Patient Registry (PFF-PR). Genome-wide association study (GWAS) summary statistics from independent cohorts were used to construct a whole-genome PRS (WG-PRS) using a clumping and thresholding method (C+T) and a Bayesian method (SBayesRC). PRS were also derived from 19 known common sentinel IPF variants (Sentinel-PRS). Logistic regression models were used to evaluate associations between PRS and carrier status. Discriminatory performance was evaluated using area under the curve (AUC) analysis, and comparisons were made with DeLong test. Validation was performed in 472 IPF individuals from the UK PROFILE cohort. Results: IPF-PRS were strongly associated with the QVs carrier status: Odds Ratio [OR] 0.65 (95% Confidence Interval [CI] 0.53-0.79) for WG-PRSC+T, OR 0.71 (95% CI 0.59-0.86) for WG-PRSSBayesRC, and OR 0.77 (95% CI 0.63-0.94) for Sentinel-PRS. Adding WG-PRS to the patient personal clinical history improved the prediction of QVs carriers: AUC=0.62 for the clinical model, AUC=0.68 for WG-PRSC+T (DeLong test, p=9.54x10-4) and AUC=0.66 for WG-PRSSBayesRC (DeLong test, p=0.02). Adding of IPF-PRS to clinical variables correctly reclassified 22.8% of carriers when using WG-PRSC+T, 20.8% when using Sentinel-PRS, and 16.7% for WG-PRSSBayesRC. WG-PRSSBayesRC and the Sentinel-PRS also demonstrated improved prediction of QVs carriers in telomere-related genes in PROFILE. Conclusions: Incorporating IPF-PRS into a model based on the patient clinical history improves the identification of QVs carriers. Although the overall discriminatory power was moderate, these findings raise de the possibility of using WG-PRS as useful criterion for rare variant discovery in patients with IPF and enhance decision-making.

10
Designing national programs for expanded carrier screening: Results from a discrete-choice experiment in Singapore

Blythe, R.; Senanayake, S.; Bylstra, Y.; Roberts, J.; Choi, C.; Yeo, M. J.; Goh, J.; Graves, N.; Koh, A. L.; Jamuar, S. S.

2026-04-13 health economics 10.64898/2026.04.09.26350563 medRxiv
Top 0.2%
2.7%
Show abstract

BackgroundCarrier screening for inherited genetic disorders can reduce the burden of conditions that lead to childhood morbidity and mortality, including thalassaemia, cystic fibrosis, and spinal muscular atrophy. To be successful, national carrier screening programs should aim to maximise uptake, which may depend on population preferences for screening characteristics. In this study, we aimed to determine how expanded carrier screening in Singapore should be designed based on operational factors including suggested copayments, wait times, and disorders included in screening panels. MethodsWe elicited stated preferences for the design of a hypothetical national carrier screening program with seven attributes from 500 Singaporeans of reproductive age (18 to 54). A discrete choice experiment was applied using 30 choice tasks with 3 alternatives per task, divided between 3 blocks. The mixed multinomial logit model was used to estimate willingness-to-pay for each attribute level. Predicted uptake for three plausible screening programs was assessed, with copayment amounts from $0 to $1,200 in increments of $30. Impact on the annual national budget was calculated as a function of 25,000 expected eligible couples per year. All costs were reported in 2026 SGD. ResultsRespondents showed the strongest preferences for cost, followed by the number of diseases included in the panel, then wait times, with limited impact of remaining attributes. With no copayments, predicted uptake ranged from 85% [95% CI: 83% to 87%] to 90% [88% to 92%] for the basic and utility-maximising screening programs, respectively. This declined to 61% [56% to 66%] and 69% [65% to 73%] and, respectively, at a copayment of $1,200 per test. The model predicted higher uptake if a selection of screening alternatives were available, compared to a single program. The budget impact was highly dependent on population eligibility, copayments, and couples decision-making processes, but was unlikely to exceed $22.5m [$19.0m to $26.6m] per year unless expanded beyond married couples. ConclusionsThere was high predicted demand for carrier screening even as copayments increased. Successful strategies to improve uptake may include reducing copays and wait times, increasing the number of screening options available to prospective parents, and increasing program eligibility beyond pre-conception married couples.

11
Implementing Reproductive Carrier Screening to Include Diverse Asian Populations: Insights from Singapore

Bylstra, Y.; Yeo Juann, M.; Teo, J. X.; Goh, J.; Choi, C.; Chan, S.; Song, C.; Chew Yin Goh, J.; Chai, N.; Lieviant, J. A.; Toh, H. J.; Chan, S. H.; Blythe, R.; Menezes, M.; Yang, C.; Hodgson, J.; Graves, N.; Sng, J.; Lim, W. W.; Law, H. Y.; Amor, D.; Baynam, G.; Chan, J. K.; Chan, Y. H.; Tan, P.; Ng, I.; Lim, W. K.; Jamuar, S. S.

2026-04-07 genetic and genomic medicine 10.64898/2026.04.07.26350306 medRxiv
Top 0.2%
2.6%
Show abstract

Background As part of Singapore's effort towards precision medicine tailored to Asian diversity, we describe the implementation of a nationwide reproductive carrier screening program. Using a customised 112-gene panel, incorporating population-specific recessive genetic diseases, we outline the overall program design, and initial efforts of community and stakeholder engagement, to inform culturally appropriate implementation. Methods Participants receive culturally tailored online education regarding our reproductive screening program and are provided results with genetic counselling and reproductive options. Community and stakeholder perspectives were assessed through questionnaires and consultations with religious leaders. Results Recruitment is nation-wide, and since initiation of our pilot phase in September 2024, 1,619 couples have registered interest, with 60% uptake of those deemed eligible. Among the 456 couples that have received results to date, four couples (0.9%) were identified to be at increased risk. Community questionnaire responses (n=1002), involving couples who participated in the program as well as the general public, indicated interest is high (59%) across the cohort but awareness, intent to participate and implications for reproductive options differed by sociodemographic factors such as ancestry and religion. Healthcare professional respondents (n=113) acknowledged carrier screening will be routine in medical care, but report limited confidence and resources. Engagement with religious leaders indicated support for the program. Conclusion These early program outcomes and community engagement are guiding the implementation of expanding population-based carrier screening in Singapore, contingent on addressing practical challenges through equitable outreach and professional training.

12
Connective tissue growth in a mouse model of Kosaki overgrowth syndrome is limited by STAT1

Kim, J.; Kwon, H. R.; Berry, W.; Olson, L. E.

2026-04-12 developmental biology 10.64898/2026.04.09.717535 medRxiv
Top 0.3%
2.3%
Show abstract

Mutations in platelet-derived growth factor receptor beta (PDGFR{beta}) cause Kosaki overgrowth syndrome (KOGS). Patients exhibit increased linear growth, craniosynostosis, and thin skin with increased elasticity and scarring. Of the KOGS patients identified to date, three unrelated individuals carried a P584R mutation in the juxtamembrane domain of PDGFR{beta}, resulting in constitutive receptor activation. Due to the limited number of patients, extensive phenotyping and exploration of the molecular basis of disease, including modifier genes, has not been completed. We generated conditional knock-in mice to express mouse PDGFR{beta} with a P583R mutation, corresponding to human P584R, under control of the endogenous Pdgfrb gene. Mutant mice were born at the expected ratio and appeared normal at birth. At 3 weeks of age, mutants began to exhibit connective tissue changes: increased body weight and bone length, craniosynostosis, ectopic bone in the tail and tendons, thin lipodystrophic skin, and high incidence of penile and rectal prolapse. To identify signaling changes caused by mutant PDGFR{beta} signaling, we performed western blotting and phosphoproteomics on dermal fibroblasts. This uncovered increased phosphorylation of PDGFR{beta}, PLC{gamma}, Akt1, Shp2, STAT1, STAT2, STAT3, and STAT5. Analysis of 6,621 proteins and 5,386 phosphopeptides identified upregulation of interferon signaling genes linked to STAT1. In many cell types, STAT1 has tumor-suppressor functions and acts to inhibit cell cycle. We generated Stat1-/- Pdgfrb+/P583R mice to test the contribution of STAT1 to KOGS phenotypes. Stat1-deletion exacerbated overgrowth and calvaria dysmorphogensis, and caused keloid-like skin fibrosis. No phenotypes present in the original Pdgfrb+/P583R mice were reverted to normal after Stat1 deletion. Therefore, the P583R mouse model mirrored KOGS phenotypes and increased activation of multiple PDGFR{beta} signaling mediators; in this context, STAT1 activity opposes PDGFR{beta}-driven overgrowth and fibrosis.

13
Berrylyzer-an Efficient, Traceable, and Lightweight Intelligent Agentic System for Prenatal Genetic Diagnosis

Meng, M.; Liu, L.; Du, Q.; Zhou, X.; Tian, Y.; Sun, K.; Li, N.; Zhang, P.; Lian, X.; Fan, N.; Zhu, N.; Li, S.; Mao, A.; Li, Y.; Zou, G.

2026-04-04 genetic and genomic medicine 10.64898/2026.04.02.26349929 medRxiv
Top 0.3%
2.1%
Show abstract

Background: Artificial intelligence (AI)-driven variant prioritization has demonstrated substantial utility in expediting genetic diagnosis by ranking the most likely causative variants. While a variety of tools have been developed, few address the unique clinical and technical constraints in prenatal genetic diagnosis. Methods: We introduce Berrylyzer, a novel, end-to-end variant prioritization system applied to prenatal diagnosis.Inspired by clinician's reasoning process during variant interpretation, Berrylyzer applies a modular, stepwise scoring architecture that jointly integrates phenotypic and genomic evidence and delivers a ranked list of candidate variants, achieving high computational efficiency without compromising analytical rigor. Moreover, Berrylyzer natively supports both structured ontologies and free-text clinical narratives, enabling flexible integration into diverse clinical environments. Its performance was rigorously evaluated across two independent, real-world prenatal cohorts and benchmarked against three state-of-the-art methods: Xrare, Exomiser, and PhenIX. Results: Across the two datasets, Berrylyzer ranked 56.41% and 58.12% of diagnostic variants first, and achieved recall rates of 94.02% and 97.42% within top 20, respectively. Berrylyzer outperformed Xrare (85.19% and 87.08%), Exomiser (84.90% and 85.98%), and PhenIX (82.05% and 88.93%). Stratified analysis consistently demonstrated superior performance across diverse disease categories, inheritance patterns, and analytical strategies. Notably, Berrylyzer exhibited robustness regardless of phenotype forms, yielding comparable top 20 recall rates for free-text descriptions and standardized terminologies. Conclusion: Berrylyzer represents an accurate, interpretable, and computationally lightweight variant prioritization system for prenatal genetic diagnosis. The superior performance across heterogeneous diagnostic contexts enables it as a practical solution for seamless integration into clinical pipelines, thereby advancing precision medicine in prenatal settings.

14
SafeGene:A Novel Computational Platform for Predictive Genetic Screening of Offspring Disease Risk Using Region-Specific Population Genetics, Mendelian Inheritance Models, and Consanguinity Coefficient Analysis in Saudi Arabia and the Gulf Cooperation Council States

ahmed, a. K.; Rodaini, s.

2026-03-30 genetic and genomic medicine 10.64898/2026.03.28.26349627 medRxiv
Top 0.3%
2.1%
Show abstract

Background: Saudi Arabia bears a disproportionate burden of autosomal recessive genetic disorders, driven by consanguineous marriage rates of 50 to 58% and elevated carrier frequencies for conditions such as sickle cell disease (carrier rate up to 25%), betathalassemia (12%), and spinal muscular atrophy (6%). The existing premarital screening program screens for only two conditions. We developed SafeGene, a computational platform that expands predictive genetic screening to 50+ conditions using region specific population genetics. Methods: SafeGene integrates five risk calculation engines: (1) Mendelian inheritance models for AR, AD, XR, and XD conditions; (2) Hardy Weinberg equilibrium based carrier probability estimation using Saudi, Gulf, and global databases; (3) a six level consanguinity coefficient calculator (F = 0 to 1/8) with risk amplification multipliers; (4) multifactorial polygenic risk models for 12 complex diseases; and (5) maternal age dependent trisomy risk curves. Built using React.js, Node.js/Express, and MongoDB with bilingual Arabic/English support. Results: The platform encompasses 50 genetic conditions across 12 categories. Validation against published Saudi data demonstrated concordance with observed disease frequencies. Economic modeling projects that expanding screening could prevent 2,800 to 4,200 affected births annually, yielding savings of SAR 1.2 to 2.8 billion ($320 to 746 million USD) per year. Conclusions: SafeGene represents a scalable, evidence-based digital health solution for comprehensive genetic screening addressing the unique population genetics of consanguineous Gulf societies. The platform is protected under pending patent applications in South Africa (CIPC) and Saudi Arabia (SAIP).

15
Health Impact Assessment of BRCA1/2 Cascade Screening for the Personalized Prevention of Hereditary Breast and Ovarian Cancers in Italy

Valz Gris, A.; Giacobini, E.; Tricomi, V.; Rumi, F.; Valentini, I.; Cristiano, A.; Testa, S.; Rosano, A.; Pezzullo, A. M.; Boccia, S.

2026-04-15 public and global health 10.64898/2026.04.13.26350758 medRxiv
Top 0.4%
1.7%
Show abstract

Introduction Pathogenic germline variants in the BRCA1 and BRCA2 genes confer a markedly increased risk of breast and ovarian cancer, for which effective preventive strategies are available. Although national and international guidelines recommend BRCA testing and cascade screening of relatives, implementation in Italy remains highly heterogeneous across regions. This study estimates the potential population health and cost impact of achieving full nationwide implementation of BRCA1/2 cascade screening in Italy and identifies key organisational barriers and priority actions for implementation. Methods We conducted a Health Impact Assessment integrating literature review, simulation modelling, and stakeholder consultation. A decision tree and Markov model compared the current heterogeneous implementation of BRCA screening in Italy with an ideal scenario reflecting full adherence to national guidelines, optimal cascade screening, and uptake of preventive strategies. Outcomes included breast and ovarian cancer incidence and mortality, healthcare costs over a lifetime horizon (80 years). Key barriers affecting organisational feasibility, acceptability, and patient well-being were assessed, and a set of priority action recommendations was developed. Results In the ideal scenario, 25,626 eligible cancer patients would undergo BRCA testing annually, identifying 4,254 mutation carriers and enabling cascade testing of 27,650 relatives, of whom 8,682 would be BRCA-positive. Under the current implementation, only 8,807 patients and 2,168 relatives are tested, identifying 948 carriers. Over 30 years, full implementation would prevent 821 cancer cases (- 27.9%) and 1,282 deaths (- 49.7%) compared with the current scenario. While initial expenditures increase due to expanded testing and preventive interventions, cumulative costs decrease over time, resulting in net savings of 5.8 million euros at 30 years and a saving per event avoided (- 2,779 euros). Major implementation barriers include fragmented governance, limited access to genetic counselling, heterogeneous laboratory practices, insufficient professional training, and weak referral pathways. Conclusion Full implementation of BRCA1/2 cascade screening in Italy would yield substantial population health benefits and long-term cost savings. Coordinated national governance, standardised pathways, investment in counselling and workforce capacity, and robust monitoring systems are essential to ensure equitable access and sustainable delivery of personalised cancer prevention. This study demonstrates the value of the HIA methodology for evaluating and guiding genomic prevention policies.

16
Development and fit for purpose validation of a quantitative LC-MS/MS method for heparan sulfate in cerebrospinal fluid as a biomarker for mucopolysaccharidosis type IIIA

Bystrom, C.; Douglass, K.; Gupta, M.

2026-03-30 genetic and genomic medicine 10.64898/2026.03.27.26348847 medRxiv
Top 0.5%
1.3%
Show abstract

Background: Mucopolysaccharidosis type IIIA (MPS IIIA; Sanfilippo syndrome) is a fatal neurodegenerative lysosomal storage disorder caused by impaired degradation of heparan sulfate (HS). Despite rapid advances in gene and enzyme therapies, there remains a critical need for an analytically validated, quantitative biomarker that accurately reflects central nervous system (CNS) substrate burden. Such biomarker would be a valuable tool in assessing disease progression and monitoring therapeutic efficacy. Objective: This study describes the method development, fit for purpose validation, and preliminary clinical application of a quantitative liquid chromatography-mass spectrometry (LC-MS/MS) assay for the HS-derived disaccharide N-sulfoglucosamine-glucuronic acid (GlcNS-GlcUA) in human cerebrospinal fluid (CSF), a critical biomarker for diagnosis, disease monitoring, and regulatory evaluation of emerging MPS IIIA therapies. Methods: A structurally defined GlcNS-GlcUA reference standard and its [13C6]-labeled internal standard were used in a derivatization and detection workflow employing 1-phenyl-3-methyl-5-pyrazolone labeling, and LC-MS/MS. Results: The method exhibited acceptable linearity across 0.005-0.500 nmol/mL (r[&ge;]0.9976), with intra- and inter-assay imprecision [&le;]3.5%CV and accuracy within 95%-110% of nominal concentrations. No matrix or hemolysis interference or carryover was observed, and the analyte remained stable during freeze-thaw storage conditions. Application of the method to 12 CSF samples from patients with MPS IIIA demonstrated quantifiable GlcNS-GlcUA levels ranging from 0.0054 to 0.106 nmol/mL, confirming suitability for clinical and regulatory use. Comparison of the MPS IIIA sample results between the development laboratory and the contract research organization laboratory support robust inter-lab assay transfer. Conclusions: This validated LC-MS/MS method establishes a regulatory-grade quantitative assay for measurement of CSF HS in MPS IIIA. Its high analytical sensitivity and reproducibility enable reliable assessment of CNS substrate reduction and pharmacodynamic response, supporting biomarker-driven therapeutic development and accelerated approval pathways for neuronopathic mucopolysaccharidoses.

17
FRMPD4, a causal gene for intellectual disability and epilepsy, is associated with X-linked non-syndromic hearing loss

Liedtke, D.; Rak, K.; Schrode, K. M.; Hehlert, P.; Chamanrou, N.; Bengl, D.; Katana, R.; Heydaran, S.; Doll, J.; Han, M.; Nanda, I.; Senthilan, P. R.; Juergens, L.; Bieniussa, L.; Voelker, J.; Neuner, C.; Hofrichter, M. A.; Schroeder, J.; Schellens, R. T.; de Vrieze, E.; van Wijk, E.; Zechner, U.; Herms, S.; Hoffmann, P.; Mueller, T.; Dittrich, M.; Bartsch, O.; Krawitz, P. M.; Klopocki, E.; Shehata-Dieler, W.; Maroofian, R.; Wang, T.; Worley, P. F.; Goepfert, M. C.; Galehdari, H.; Lauer, A. M.; Haaf, T.; Vona, B.

2026-03-30 genetic and genomic medicine 10.64898/2026.03.27.26349271 medRxiv
Top 0.5%
1.3%
Show abstract

Abstract Background Understanding the phenotypic spectrum of disease-associated genes is essential for accurate diagnosis and targeted therapy. FRMPD4 (FERM and PDZ Domain Containing 4) has previously been associated with intellectual disability and epilepsy. However, its potential role in non-syndromic hearing loss has not been explored. Methods We performed genetic analysis in two unrelated families presenting with non-syndromic sensorineural hearing loss, identifying maternally inherited missense variants in FRMPD4. Clinical phenotyping included audiological assessment and evaluation for neurodevelopmental involvement. Cross-species expression analyses were conducted in Drosophila, zebrafish, and mouse. Functional characterization included quantitative evaluation of sound-evoked responses in Drosophila nicht gut hoerend (ngh) mutants, assessment of neuronal development and acoustic startle responses in zebrafish loss of function models, and morphological cochlear analyses with auditory brainstem response measurements in knockout mice. Results Three affected males from two unrelated families presented with prelingual, bilaterally symmetrical sensorineural hearing loss, with confirmed congenital onset in one individual and no evidence of neurodevelopmental abnormalities. Cross-species analyses demonstrated evolutionarily conserved expression of FRMPD4 in auditory structures. In Drosophila, quantitative analysis of sound-evoked responses in ngh mutants revealed impaired auditory function. Zebrafish loss of function models exhibited reduced neuronal populations in the otic vesicle and posterior lateral line, abnormal neuromast development, and diminished acoustic startle responses. In mice, Frmpd4 knockout resulted in high-frequency hearing loss and cochlear abnormalities consistent with the human phenotype. Conclusions Our findings expand the phenotypic spectrum of FRMPD4 to include non-syndromic sensorineural hearing loss and establish its evolutionarily conserved role in auditory function. These results have direct implications for genetic diagnosis and variant interpretation in patients with hearing loss.

18
Duplication within 14q32.13 implicates a chimeric CLMN::SYNE3 RNA transcript in cerebellar ataxia

Litster, T. M.; Wilcox, R. A.; Carroll, R.; Gardner, A. E.; Nazri, N. M.; Shoubridge, C. A.; Delatycki, M. B.; Lohmann, K.; Agzarian, M.; Turella Divani, R.; Rafehi, H.; Scott, L.; Monahan, G.; Lamont, P. J.; Ashton, C.; Laing, N. G.; Ravenscroft, G.; Bahlo, M.; Haan, E.; Lockhart, P. J.; Friend, K. L.; Corbett, M. A.; Gecz, J.

2026-04-24 genetic and genomic medicine 10.64898/2026.04.23.26350376 medRxiv
Top 0.5%
1.2%
Show abstract

The spinocerebellar ataxias (SCAs) are a clinically heterogenous group of neurodegenerative disorders that affect movement, vision, speech and balance. Here, we reassign the linkage of SCA30 to 14q32.13 based on a cumulative LOD score >12. Within this interval we identified a 331 kb duplication, absent in population controls and not observed in >800 unrelated individuals with genetically unresolved cerebellar ataxia. RNASeq analysis of patient-derived lymphoblastoid cell lines revealed a splice-mediated chimeric transcript resulting from the duplication event. This transcript joined exon 1 of CLMN to exon 2 of SYNE3. In silico translation predicted that this chimeric transcript would produce a short N-terminal peptide corresponding to exon 1 of CLMN and the usually untranslated region of exon 2 of SYNE3 fused to the complete and in-frame SYNE3 protein. Transient overexpression of SYNE3 or the CLMN::SYNE3 fusion protein, in both HeLa cells and mouse primary cortical neurons, resulted in equivalent cellular outcomes including altered nuclear morphology and chromosomal DNA fragmentation. SYNE3 forms part of the linker of nucleoskeleton and cytoskeleton complex and is not usually expressed in cerebellar Purkyn[e] neurons while, CLMN has a Purkyn[e] specific expression pattern within the brain. Our data suggests that ectopic expression of SYNE3 in cerebellar Purkyn[e] neurons, mediated by the CLMN promoter, leads to cerebellar atrophy and causes spinocerebellar ataxia in the SCA30 family. This is an example of Mendelian disease arising from a novel, chimeric transcript with a likely dominant negative effect. Chimeric transcripts are commonly associated with cancers, but they are not often associated with monogenic disorders. Detection of chimeric transcripts as part of structural variant analysis could increase the genetic diagnostic yield of Mendelian disorders.

19
Unraveling the potential of short and long read sequencing for human genome profiling

Leduc, A.; Bachr, A.; Sandron, F.; Delepine, M.; Delafoy, D.; Fund, C.; Daviaud, C.; Meslage, S.; Turon, V.; Bacq-Daian, D.; Rousseau, F.; Olaso, R.; Deleuze, J.-F.; Gerber, Z.; Meyer, V.

2026-04-22 genomics 10.64898/2026.04.20.719568 medRxiv
Top 0.5%
1.2%
Show abstract

Background: Short read sequencing technologies have dominated the field of human whole genome sequencing in the past years in terms of cost, throughput, and accuracy. However, thanks to recent technological evolution, long read approaches have become increasingly competitive and complementary to short reads. With the gap in the cost per genome closing slowly between both approaches, long reads might replace short read sequencing in future research and clinical applications. Still, comprehensive evaluation is necessary to conclude on the performance and general advantages of each technology. Results: In this study, we compared the latest chemistries of major suppliers of short and long read technologies: Illumina short reads, Illumina Complete Long Reads (ICLR), Pacific Biosciences HiFi reads (PacBio), and Oxford Nanopore Technologies long reads (ONT). Using the HG002 human reference sample and established bioinformatics guidelines, we assessed their variant calling performance against the latest available truth sets at different levels of coverage. For single nucleotide variant detection, all technologies were equivalent. Despite the latest improvements in chemistry, indel calling with ONT continues to lag in accuracy behind other technologies. In contrast, long reads delivered a clear advantage in structural variant detection, surpassing short reads in both accuracy and sensitivity. The hybrid ICLR approach achieved intermediate performance, narrowing the gap between short and long read sequencing. Furthermore, long reads enhanced haplotype-phasing resolution, enabling the phasing of over 80% of the genome. Conclusions: These findings highlight the specific strengths and limitations of recent sequencing technologies, aiding the decision-making in future research projects, technological platforms development, and clinical applications.

20
Neuropathy Assessment and Treatment Patterns in Patients With Hereditary Transthyretin Amyloidosis: A Single-Center Analysis of Stabilizer and Gene Silencer Utilization

Streicher, N. S.; Wubet, H.

2026-04-17 neurology 10.64898/2026.04.15.26350949 medRxiv
Top 0.5%
1.2%
Show abstract

Background: Hereditary transthyretin amyloidosis (hATTR) manifests as cardiomyopathy and/or polyneuropathy. The V142I variant predominantly causes cardiac disease in African Americans, though neurological involvement may be underrecognized. We characterized neuropathy documentation and treatment patterns in a predominantly V142I cohort. Methods: Retrospective review of 54 hATTR patients at a major academic medical center. Neuropathy was classified as: objective (abnormal EMG), possible polyneuropathy (documented symptoms suggestive of polyneuropathy), symptoms only (neuropathic symptoms without specialist evaluation), or unclear. Treatment with stabilizers (tafamidis, acoramidis, diflunisal) and gene silencers (patisiran, vutrisiran, eplontersen) was assessed. Results: Of 54 patients (88.9% African American, 85.2% V142I), 51 (94.4%) had confirmed cardiac involvement. Among cardiac patients, 40/42 eligible (95.2%) received stabilizers. Overall, 16 patients (29.6%) received gene silencers, with 13 (24.1%) receiving both a stabilizer and gene silencer concurrently. Possible neuropathy (objective, possible polyneuropathy, or symptoms) was documented in 30 patients (55.6%). Gene silencer use was highest among those with objective neuropathy (8/17, 47.1%) versus symptoms only (1/10, 10.0%). All three patients without confirmed cardiac disease received gene silencers. Conclusions: In this V142I-predominant cohort with 94.4% cardiac involvement, stabilizer use was high (95.2%) among eligible patients. Over half had possible neuropathy based on clinical documentation, though EMG completion was limited (57.4%). Gene silencer use was associated with objective neuropathy documentation and non-cardiac phenotype. These findings support systematic neurological assessment in hATTR, even when cardiac disease predominates.